Stabilizing variable selection and regression

نویسندگان

چکیده

We consider regression in which one predicts a response Y with set of predictors X across different experiments or environments. This is common setup many data-driven scientific fields, and we argue that statistical inference can benefit from an analysis takes into account the distributional changes In particular, it useful to distinguish between stable unstable predictors, is, have fixed changing functional dependence on response, respectively. introduce stabilized explicitly enforces stability thus improves generalization performance previously unseen Our work motivated by application systems biology. Using multiomic data, demonstrate how hypothesis generation about gene function regression. believe similar line arguments for exploiting heterogeneity data be powerful other applications as well. draw theoretical connection multi-environment causal models allows graphically characterize vs. response. Formally, notion blanket subset lies direct Markov blanket. prove this optimal sense based these minimizes mean squared prediction error, given resulting generalizes new

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Variable Selection in ROC Regression

Regression models are introduced into the receiver operating characteristic (ROC) analysis to accommodate effects of covariates, such as genes. If many covariates are available, the variable selection issue arises. The traditional induced methodology separately models outcomes of diseased and nondiseased groups; thus, separate application of variable selections to two models will bring barriers...

متن کامل

Variable Selection for Regression Models

A simple method for subset selection of independent variables in regression models is proposed. We expand the usual regression equation to an equation that incorporates all possible subsets of predictors by adding indicator variables as parameters. The vector of indicator variables dictates which predictors to include. Several choices of priors can be employed for the unknown regression coeecie...

متن کامل

Variable Selection in Quantile Regression

After its inception in Koenker and Bassett (1978), quantile regression has become an important and widely used technique to study the whole conditional distribution of a response variable and grown into an important tool of applied statistics over the last three decades. In this work, we focus on the variable selection aspect of penalized quantile regression. Under some mild conditions, we demo...

متن کامل

Variable Selection in Semiparametric Regression Modeling By

In this paper, we are concerned with how to select significant variables in semiparametric modeling. Variable selection for semiparametric regression models consists of two components: model selection for nonparametric components and selection of significant variables for the parametric portion. Thus, semiparametric variable selection is much more challenging than parametric variable selection ...

متن کامل

Variable Selection for Multivariate Logistic Regression Models

In this paper, we use multivariate logistic regression models to incorporate correlation among binary response data. Our objective is to develop a variable subset selection procedure to identify important covariates in predicting correlated binary responses using a Bayesian approach. In order to incorporate available prior information, we propose a class of informative prior distributions on th...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: The Annals of Applied Statistics

سال: 2021

ISSN: ['1941-7330', '1932-6157']

DOI: https://doi.org/10.1214/21-aoas1487